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Abstract 



o 

J> 1 This paper reviews standard oversampling strategies as performed in the Mul- 

tiscale Finite Element Method (MsFEM). Common to those approaches is that 
' the oversampling is performed in the full space restricted to a patch but including 

coarse finite element functions. We suggest, by contrast, to perform local com- 
' putations with the additional constraint that trial and test functions are linear 

independent from coarse finite element functions. This approach re-interprets the 
Variational Multiscale Method in the context of computational homogenization. 
This connection gives rise to a general fully discrete error analysis for the proposed 
' multiscale method with constrained oversampling without any resonance effects. In 

pH . particular, we are able to give the first rigorous proof of convergence for a MsFEM 

■ with oversampling. 
C3 ■ 

Keywords a priori error estimate, finite element method, multiscale method, Ms- 
FEM, oversampling 

■ AMS subject classifications 35J15, 65N12, 65N30 

ON; 

1 Introduction 

t-h ; 

\ The numerical treatment of partial differential equations with rapidly varying and 

strongly heterogeneous coefficient functions is still a challenging area of present re- 
search, especially with regard to applications such as porous media flow or the trans- 
port of solutes in groundwater. In such problems, the occurring permeabilities and 
hydraulic conductivities have rapidly changing features due to different types of soil, 
microscopic inclusions in the bottom or porous subsurface rock formations. These fine 
scale structures are highly heterogeneous and typically they need to be accounted in 
the whole computational domain in order to obtain reliable approximations of impor- 
tant physical effects. Therefore, there is need for computational grids that resolve the 
fine scale variations. But when pore scale effects become relevant or when domains 
spread over kilometers, the computational load becomes extremely large and in several 
applications even too large to treat the problem with standard finite element or finite 
volume methods. This is just one instance of a so called multiscale problem as it arises 
in hydrology, physics or industrial engineering. 
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To avoid (or at least decrease) the computational issues that come along with mul- 
tiscale equations, there are a lot of methods that are specifically constructed to deal 
with this kind of problems. Typically, these methods decouple the global fine scale 
problem into localized subproblems which can be treated independently from each 
other and with comparably low costs. The list of meanwhile proposed multiscale meth- 
ods is long. Amongst the most popular methods are the heterogeneous multiscale 
finite element method (HMM), initially introduced by E and Engquist [EE03a] (see 
also |EE03bL IEE051 IAbd09| or the Variational Multiscale Method by Hughes et al. 
|Hug95j |HFMQ98| (see also |LM07l IMallll IMPllj ) 

In this paper, we deal with another popular method: the Multiscale Finite Element 
Method (MsFEM) proposed by Hou and Wu [HW97J and further investigated in several 
contributions [HWC99I lEHWOOl IEP031 IHWZ04| . There is an ongoing development of 
the method to apply it to various fields and equations. For instance, a MsFEM for 
nonlinear elliptic problems is proposed in |EHG04| . a formulation for two phase flow 
problems in porous media is presented in [EH07] , advection diffusion problems are 
treated in jDYX09] and an application to elliptic interface problems with high contrast 
coefficients is presented in |CGH10| . A survey on the method is given in the book by 
Efendiev and Hou [EH09] . There is vast literature of works devoted to the method but 
there are still open questions of strong interest. The most relevant issue is a rigorous 
error analysis of the method, in particular in the case of non-periodic microstructures. 

The essential idea of the MsFEM is to start the computations with a common finite 
element space with a coarse underlying grid. Then, the basis functions are modified 
to account for fine scale variations that cannot be captured by a coarse grid. More 
specifically, local fine scale computations are performed to determine so called corrector 
functions. These corrector functions can be added as local perturbations to the original 
set of basis functions of the coarse finite element space. 

However, it is well known that the classical MsFEM suffers from so called resonance 
errors, which are typically of order O(jj-), where e denotes a characteristic size of 
the small scale and where H denotes the mesh size of the coarse grid (c.f. [HWC99, 
HWZ04 ). This implies that the numerical error becomes large in regions where the 
coarse grid size is close to the characteristic length scale of the microscopic oscillations. 
There are two different explanations for this error. The first one is a mismatch between 
the boundary conditions imposed for the local fine scale problems and global behavior 
of the oscillatory exact solution (c.f [EHWOO ). The second explanation is due to size 
and geometry of the sampling patch (c.f. [HWZ04J). The averaged behavior in such a 
patch should be 'representative' so that we can speak about a perfect sample size. If 
this is not the case, the final approximation might be distorted. In the periodic setting, 
for instance, the sampling domain should be some multiple of the periodic cell. On 
triangular patches with cathetuses of the length of a period, this patch is only half a 
periodic cell (i.e. the patch has bad size and geometry) and lacks essential information. 
This yields a completely wrong approximation (c.f. [Henl2j ). 

In a lot of applications, such as oil reservoir simulations or the transport of solutes in 
groundwater, a characteristic micrscopic length scale e is unknown, cannot be identified 
or does not exist at all. In scenarios without a clear scale separation it is often impossible 
to predict whether or not we are in the regime of resonance errors and it is very likely to 
actually hit the problematic regime. Hence, the quality of the final approximation can 
not be determined unless resonance errors are eliminated. For this purpose, different 
oversampling strategies have been proposed. The fundamental idea of each of these 
techniques is to extend the local problems to larger patches, perform the computation on 
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these oversampling domains but feed the coarse scale equation only with the information 
obtained within the original smaller patches. This reduces the effect of wrong boundary 
conditions and bad sampling sizes. In this paper, we present the two major strategies 
for oversampling and discuss their advantages and disadvantages. On the basis of these 
considerations we propose a new strategy that overcomes the issues of the previous 
strategies and of which we can prove rigorous error estimates for the corresponding 
multiscale approximations. 

This contribution is structured as follows: In Section [5] we recall the classical formula- 
tion of the MsFEM without oversampling. The most popular approaches for oversam- 
pling are discussed in Section and in Section 2] we propose a new strategy for which 
we present a rigorous error analysis. 

2 The Multiscale Finite Element Method 

In this section, we state the setting of this paper and we establish the required notations. 
We recall the classical Multiscale Finite Element Method (MsFEM) as initially proposed 
by Hou and Wu |HW97j . 

2.1 Setting an notation 

Consider a bounded Lipschitz domain Q C K d with piecewise flat boundary and some 
matrix- valued coefficient A E L°°(J),Rg^) with uniform spectral bounds 7 m i n > and 

Tmax ^ 7mini 

a{A(x)) C [7min,7max] for almost all xe!l. (1) 
Given / E L 2 (S1), we seek the weak solution of 

-V • AVu = f inn, 
u = on dQ, 

i.e., we seek u E Hq(Q) := {v E H 1 ^) | v\gn = in the sense of traces} that satisfies 

a(u,v):= [ AVu-Vv= [ fv=:F(y) for all v E flo(O). (2) 
Jn Jn 

We consider two discretization scales H > h > 0. The coarse scale H is arbitrary 
whereas the small scale parameter h is constrained by the problem; it is assumed to be 
smaller than the characteristic length scales of the variations of the diffusion coefficient 
A. 

Let 7~h, Th denote corresponding subdivisions of Q into (closed) triangles (for d = 2) 

and tetrahedra (for d = 3), i.e., 0, = (J t = \J T. We assume that Th, Th are regular 

teT h TeTjj 

in the sense that any two elements are either disjoint or share exactly one face or share 
exactly one edge or share exactly one vertex. For simplicity we assume that Th is 
derived from Th by some regular, possibly non-uniform, mesh refinement. 
For T = Th,Ti, let 

Px(T) = {v E C°(n) | VT E T,v\ T is a polynomial of total degree < 1} 

denote the set of continuous and piecewise affine functions. 
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Accordingly, Vh := Pi(Th) H Hq(Q) denotes the 'high resolution' finite element 
space and the 'coarse space' is given by Vh '■= P\{Th) H Hq{Q) C Vh- For any given 
subset wcSlwe define the restriction of Vh to u) with a zero boundary condition by 



A general function in v E V^Th may jump across edges of the coarse mesh Th and, 
hence, does not belong to Hq(Q). However, the Tff-piecewise gradient Vhv, with 
(Vhv)\t = V(«|t) for all T E Tff, exists. Typically, MsFEM approximations obtained 
with oversampling are nonconforming approximations of the exact solution in the sense 
that they do not belong to Hq(O). 

In the following xt E T denotes an arbitrary point, for instance the barycenter of 
T. For &h E Vh and T E Th, the affine extension operator Et '■ Vh — > Pi(&) is given 



Finally, by xt we denote the characteristic (or indicator) function with xt(x) = 1 for 
x E T and Xt{%) = elsewhere. 

For the sake of simplicity, all fine scale compuatations are performed in subspaces 
of the finescale finite element space Vh- The Galerkin solution Uh E Vh which satisfies 



may, hence, be considered as a reference approximation. Note that we never solve this 
large-scale equation. The function Uh serves as a reference solution to compare our 
multiscale approximations with. The underlying assumption is that the mesh Th is 
chosen sufficiently fine so that Uh is sufficiently accurate. 

Throughout this paper, standard notation on Lebesgue and Sobolev spaces is em- 
ployed and a < b abbreviates an inequality a < C b with some generic constant 
< C < oo that may depend on the shape regularity of finite element meshes and 
the contrast 7max/7min but not on the mesh sizes H, h, and the regularity or the 
variations of the diffusion matrix A; a ~ b abbreviates a < < a. 

2.2 The classical MsFEM and reformulation 

We first present the classical MsFEM without oversampling as originally stated by Hou 
and Wu [HW97], and similarly by Brezzi et al. [BFHR97J. They proposed the strategy 
to enrich the set of standard finite element basis functions by fine scale information. The 
information is determined by solving local problems on the fine scale. We recall briefly 
the method and reformulate it in terms of a correction operator Qh and a corresponding 
corrector basis. 

Let N denote the dimension of the coarse space Vh and let {&i \ 1 < i < N} denote 
the usual nodal basis of Vh- Given some basis function <3?j, the corresponding MsFEM 
basis function <j>mspem ^ - g un iq Ue iy determined by the conditions that for all T E Th 




by: 



Et($ h )(x) ■= {x - x T ) ■ V$h(x t ) + <f> H (x T ). 



a h (u h ,v) = F(v) for all v e V h 



(3) 



and for all (fth E Vh(T) there holds 




MsFEM 



3>i on dT. 



(4) 
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The span of these MsFEM functions is called the MsFEM solution space 

yjMsFEM . = span {$MsFBM | 1 < j < j\T}. 

In particular, y™ M C V h C H^(U), because {$f FEM | 1 < i < N} defines a 
conforming set of basis functions. The classical MsFEM in Petrov-Galerkin formulation 
reads as follows: 

Definition 2.1 (MsFEM without oversampling). The MsFEM approximation u H sFEM G 
yMsPEM ig defined as t he solution of 

/ AVu H sFEM (x) • V$ H (x) dx = [ f(x)<$> H (x) dx for all $ H G V H . (5) 
Jn Jn 

With regard to the general framework for oversampling that we introduce in the 

subsequent sections, we note that the MsFEM can be rewritten in the following way: 

Remark 2.2. If u™ sFEM G y^ sFEM denotes the MsFEM approximation stated in Defi- 
nition [2TTJ then we have ■u^ sFEM = uh + Qh(un), where uh € Vh solves 



Pa) 



[ A (Vu H + VQ h (u H )) ■ V$ H = [ f$ H for all <$> H G V H , 
Jn Jn 

with 

d 

Q h {$ H ){x) := Y, d ^ H ^ WT ^ x ^ ^ 

T<=T H i=l 

and wt,i £ Vh(T) being the unique solution of 

f AVw T ,i ■ V(f> h = - f Aei ■ V(j> h for all fa G V h (T). §\c) 
Jt Jt 

The set of all functions wt,i is what we are going to call a local corrector basis. Note 

that from the computational point of view, it is cheaper to compute the corrector basis 

instead of directly computing the set of multiscale basis functions given by (HJ). The 

latter one involves more problems to solve. For instance if d = 2, the assembling of 

the corrector basis {wx,i \ T € Tn,i = 1,2} requires the solution of 2 • \Th\ local 

problems, whereas the solutions of 3 • \Th\ local problems are required to assemble 

I^msfem | 1 < i < N} by only using the coarse basis functions (for which we have 3 

per coarse element). 

The equivalence between the formulations ([5]) and ([6]) can be easily verified by the 
relation $¥ sFEM = Qh(<3?j). Observe that for every i, for every T G Th and every 

faeV h (T), 

j A{x) (V^(x) + VQ h ($i)(x)) ■ V(f>(x) dx 

d „ 

= V d Xi <f> H (x T ) / A{x) (a + Vw T (x)) ■ Vfax) dx = 

and $i + Q h (<Z>i) = $>i on dT, which is the definition of $f sFEM . 
A symmetric version of (|6Laf) is given by: find uh G Vh with 



I A{Vu H + S7Q h (u H )) ■ {V§h + VQh($H)) = I f$ H (7) 
Jn Jn 

for all &h G Vh- Note that (|6lap and ([7]) are identical, because 

/ A (Vu H + VQ h (u H )) • V0 fe = for all fa G V h (T). 
Jt 
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3 Oversampling strategies 

As already discussed in the introduction, the classical MsFEM in Definition 12 . 1 1 can be 
strongly affected by resonance errors which might lead to completely wrong approxi- 
mations of the exact solution (c.f. [EH09]). In the absence of scale separation or any 
knowledge about a suitable sample size for the local problems, the classical MsFEM 
needs a modification. Oversampling is considered to be a remedy to this issue. Over- 
sampling means that the local problems (|61ap are solved on larger domains, but only the 
interior information (i.e. we restrict the gained fine scale information to T) is commu- 
nicated to the coarse scale equation (|61ap . This reduces the effect of a wrong boundary 
condition imposed for the local problems. However, there is no unique way of extending 
the local problems (|61ap to larger patches. Different extensions lead to different over- 
sampling strategies. We now present the two common approaches for oversampling. 
We rephrase both approaches so that they fit into a common framework. We discuss 
the advantages and disadvantages of the methods and then we propose our new over- 
sampling strategy. Note that each of the subsequent strategies is a generalization of 
the case without oversampling. 

Let us introduce some additional notations. First we introduce the notion of an admis- 
sible patch. 

Definition 3.1 (Admissible patch). For T £ Th, we call U(T) an admissible patch of 
T, if it is non-empty, open, and connected, if T C U(T) C and if it is the union of 
elements of Th, i-e. 

U(T) = hit |J r, where Th* C T h . 

A given set of admissible patches is given by U, i.e. 

U := {U(T) | T £ Th and U(T) is an admissible patch}, 

where U contains one and only one patch U(T) for each T € Th- The set U(T) \ T 
is called an oversampling layer. The thickness of the oversampling layer is denoted by 
du,T '■= d\st(T,dU(T)). Furthermore, we define 

df} in := min du,T and <ig ax := max du,T 
the minimum and maximum thickness. 

In the spirit of (|61ap and ([7]), we now define the coarse scale equation for an arbitrary 
Multiscale Finite Element Method with a chosen oversampling strategy. As we will 
see later on, all MsFEM realizations only differ in the correction operator Qh that 
determines the oversampling strategy. 

Definition 3.2 (Framework for Oversampling Strategies). Let a = 1,2,3 denote the 
index of the oversampling strategy that we specify in the following sections. Let fur- 
thermore 

I l<i<d,Te Th} 
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denote a given local corrector basis that depends on the chosen strategy a (see (|6ja)- 
dUc) for the trivial case of such a basis). Now, Q^ ,a ■ Vh — > Vk.Th with 

d 

Q U h ' a ($ H )(x) := £ xt(x)^^^(xt)^(x) for $ H € V H (8) 

defines the (not necessarily conforming) correction operator. We define the MsFEM 
approximation u H +Q^ a (u H ), obtained with strategy a in Petrov-Galerkin formulation 
by: find u H £ Vh such that 




The MsFEM approximation u^j a + Q^' a (u^ a ), obtained with strategy a and a sym- 
metric formulation is given by: find u^j sym E Vh with 




= / + <#°(*ir)) 
./n 

for all &h £ Vff ■ Observe that strategies can only differ in the choice of the corrector 
basis. The remaining structure of the methods is always the same. 

In the next section we demonstrate how existing oversampling strategies fit into the 
framework presented in Definition 13.21 

3.1 Classical strategy initially introduced by Hou and Wu 

We start with describing the classical oversampling strategy as proposed by Hou and Wu 
|HW97j and further used and investigated in several works (c.f. |EHGn4llCS08llEHn9| V 
Let T G Th be fixed and let •••> ^d+il c Vh denote the basis functions 

that belong to the d nodal points in T. Hou and Wu [HW97| proposed the following 
oversampling strategy: solve for $J € Vh(U(T)) with 

/ AViJ-V^ h = for all <f> h e V h (U(T)) (11) 

JU(T) 

and the boundary condition $J = Et(&J) on dU(T) (where Et(<$>J) denotes the affine 
extension of (<3?J)|t)- Then, for a given coarse function G Vh, $^ sFEM is defined by 

d+l 
3=1 

where the Cj are such that & H aFEM (xj) = ^n(xj) for all d + 1 coarse nodes Xj of T. 
The final coarse scale equation reads: find u H sFEM E Vh f H with 

/ ^Vrf FEM -V^= ( f$ H (12) 

for all <&h € Vff- Note that u H sFEM is a nonconforming approximation of u. 
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Note that in [HWZ041 lEHWOO) . a slightly different condition is used to define the 
coefficients Cj. However, it turns out that this modified condition leads to nothing but 
Oversampling Strategy 2 below. 

We shall rephrase this multiscale method with oversampling strategy in the frame- 
work of Definition 13,21 For this purpose, let {<f>x, <p2 > •■•> 4^ I ^ denote the fine 
scale basis functions that belong to the (coarse) nodes in T G Th and let Qt{^h) '■= 
(JjMsfem _ E T (q) H ^ define local corrector to communicate the gained information to the 
coarse scale equation. The corresponding reduced fine scale space V£(U(T)) is given 
by 

K(U(T)) :=T4([/(r))\span{^,^,...,^ +1 }. (13) 

First, we observe 

Qt(^h)(x 1 ) = ^ FEM ( Xi ) - <S> H (xi) = for all nodes x, in T, 

which gives us Qt(<&h) G V[{U{T)). Next, by the definition of $m sFEM , we get that 
Qt($h) € V£(U(T)) is the unique solution of 



/ A(V$ H (x T ) + VQ T (®H))-V(f>h 
JU(T) 

[ A (VEt($ h ) + VQ t ($h)) • V<f>h 

JU(T) 

[ AV$J • V0 h = 0, 

JU(T) 



IU(T) 
d 

i=l 



for all (fth G V£(U(T)) (where the trial space might be also replaced by Vh(U(T))). 
Again, since V&h(xt) is a constant in U(T) we can rewrite Qt(&h) in terms of a 
corrector basis. This gives us the first definition of oversampling within our framework. 
We summarize the strategy as follows: 



Oversampling Strategy 1. Let V£(U(T)) denote the reduced fine scale space given 



by CL3]) and let G V£(U(T)) (for i G {l,2,..,d}) denote the solution of 



/ AVw^ i -V<i> h = - [ Aa-Vcfh for all <j> h G V^(U(T)). (14) 

JU(T) JU(T) 

For &h G Vh we define the corrector Q^^^h) G Vhj~ H by 

d 

Q U h\^h):= XT(x)J2d^H(x T )w U h ^.(x). 

TeT H j=i 

Now, u\ + Q^iujj), where ujj G V H is the solution of ([9]), defines the MsFEM 
approximation obtained with Oversampling Strategy 1, i.e. 

£ [ A(vu 1 H + VQ^ a (u 1 H )yv<S> H = f f$ H foraU^eVff. 

In particular, we have it^- sFEM = u\j + Q^' 1 ^)/) with it^ sFEM denoting the solution of 
(fT2|) . We therefore define 

1, MsFEM 1 . ^W.l/ 1 \ 

U H -= U H + Qh ( U H)- 
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Observe the equivalence of this formulation with the non-oversampling case for 
U{T) = T as stated in Remark 12.21 

Remark 3.3. The explicit boundary condition for the local problems is often 
missing in literature (c.f. [HW97} IEH 09 1 ) . However, it seems that these computations 
were performed for the case described above, i.e. the solution <&J of (|lip takes the 
values of an affine function on dU(T) (c.f. [EH W00[ IHWZ04| which also refer to the 
numerical experiments in |HW97j ). In some works (c.f. [EHG04!]) the local problems 
(|lip are formulated with the boundary condition of = on dU(T). This seems to 
be a mistake because if U(T) is larger than the support of the original basis functions, 
the new basis functions will be equal to zero. 

3.2 Oversampling motivated from homogenization theory 

The second type of oversampling is motivated from numerical homogenization theory. 
Assume that we regard 

find u £ G flo(n) with / A £ Vu £ ■ V$ = / /$ for all $ G H%(Sl) 

Jn Jo, 

and assume that A 6 is uniformly bounded and coercive in e, that A 6 is ^/-convergent 
to some matrix A and that u £ — 1 u° in H 1 ^), where u° G H£(tl) is called the 
homogenized solution. 

Then, a numerical approximation of the homogenized solution u° can be obtained 
by discretizing a more convenient equation (see (fT5"j) below). For this purpose, let 
B(x,rj) denote an open ball centered at x G Q with radius rj > and let N(x,rj) denote 
an open neighborhood of x G £1 with a Lipschitz boundary. It is assumed that there 
exist < c < C so that for all r] > and all x G there holds c\B(x,rj)\ < \N(x,r])\ < 
C\B(x,i])\. We seek u £ ^ G £#(fi) solving 

/ f{x)${x)dx 
Jn 

= [ {Nix,^- 1 [ A £ (y)(x7u £ ^(x) + V y Q(u £ ^)(x,y)) -V<S>{x)dydx (15) 

for <J> G Hq(CL), where for given $ G Hq(Q) the corrector Q(^f)(x, •) G Hq (N(x, rj + ()) 
is the solution of 

/ A £ (y) (V*(x) + V v Q(*)(x, y)) • V0(y) = 

JN(x,lj+0 

for all tfj G Hq(N(x, rj + £)). Then, if Q = Q{rf) and lim^o = 0, we have 

lim lim \\u° - ii e,r? ^|| uiic>\ = 0. 

In particular, we get 

lim lim \\u £ — u e ' v ^\\ rami = 0. 

This result was shown by Gloria |Glo061 IGI0O8] in a general nonlinear setting. Since 
U £ >V,C yields a good approximation of u £ , the result suggests to look at discretizations of 
(|15p . This was for instance exploited in |HQllj . A general numerical framework that 
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can be seen as a discretization of (|15p was proposed in [HQ 12] . Particularly, the HMM 
and the MsFEM are recovered from the framework, which leads to a straightforward 
oversampling strategy. This strategy can be formulated as follows (c.f. [Glo08[ IHQ12[ 
IHOS12] ): 

Oversampling Strategy 2. For i G {1,2, ..,d}, let G V£(U(T)) solve 

f AVw^ r V<p h = - [ Ae l -Vc/ )h for aU </> h G V h (U(T)). (16) 

JU(T) ' ' JU(T) 

and for G Vh, let Q fe ' ($#) G Vh,T H denote the corrector given by (JS]). Now, if 
Ujj G Vjj is the solution of © , then + Q l h' 2 { u %) defines the MsFEM approximation 
obtained with Oversampling Strategy 2. We therefore denote 

2, MsFEM 2 i /~,U,2/ 2 \ 

U H -= U H + Qh ( U H)- 

We immediately see, that Oversampling Strategy 1 and 2 only differ in the fine scale 
trial space for the local problems and that they are identical for U(T) = T, even though 
Strategy 2 was formulated independently of Strategy 1. In |HWZ04[ IEHW00] . Strategy 
2 is written in terms of an asymptotic expansion in the periodic case. Also note that 
this second approach is closely related to the Heterogeneous Multiscale Finite Element 
Method, where the same type of oversampling is used (c.f. [EE03a, EE03b, EE05, 
HOQjjJ). Notably, HMM and MsFEM can be reinterpreted in a common homogenization 
framework (c.f. |Glo06t fGloOS] ) and in a common numerical framework (c.f. [H Ol2| ). 

3.3 Discussion of the strategies 

As we just discussed, there are two widely used strategies for oversampling for the Ms- 
FEM. However, the difference between both approaches is only minor and the behavior 
of the resulting approximations appears to be qualitatively the same. The small differ- 
ence in the local trial spaces does not seem to have a significant impact. At least, the 
estimates available for strategy 1 are the same as for strategy 2 and vice versa. Fur- 
thermore, the available literature does not seem to distinguish between these strategies. 
For instance, [CS0 8] (using Oversampling Strategy 1) claims to generalize the results 
of [EHWOO] (using Oversampling Strategy 2). Such a mixture of strategies can be ob- 
served in several works on this topic. To the best of our knowledge, even though both 
approaches seem to behave identical, a rigorous proof of this conjecture is still missing. 
Strategy 1 suggests to fix the corrector Q^(tf>) in the corners of the coarse grid element 
T (forcing it to zero), whereas the corrector proposed by Strategy 2 does not have such 
a restriction leaving it completely free in these corners. 

Remark 3.4. As already mentioned, the MsFEM might be also considered in a sym- 
metric formulation (c.f. [EHWOOj), i.e. the coarse scale equation reads: find uh G Vh 
with 

£ / A (Vu% + VQ u h ' a (u H )) ■ (V$ H + VQ u h > a (<S> H )) 
TeT H T 

= [ f($H + Q U h ' a (<S>H)) 

Jn 

for all <f>H G Vh and where Q^ ,a is defined either with Oversampling Strategy 1 or 
2. However, the theoretical and numerical results in [HWZ04J show that this version 
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of the method still suffers from resonance errors. One explanation was suggested by 
Gloria [GI0O8] who proposed a simple computation: 



J A(yu% + VQ"' (t&)) ■ (v<& H + VC# a (<&* 
= f^A{yu a H + VQ^ a {u a H )) -V<S>h 

+ (yu% + VQ U h ' a (u a H )) ■ VQ u h ' a ($ H ) 
= J A (Vu% + VQ% a (u%)) ■ V$ H 

+ [ A hu% + VQ u h > a (u a H )) ■ VQ U h ' a ($ H ). 

JU(T)\T y ' 

This means that the effective MsFEM bilinear forms in Petrov-Galerkin and non- 
Petrov-Galerkin formulation differ in the term 

£ / A(Vu% + VQ U h ' a (uH))-VQ u h > a ($ H ), 

t^r JU(T)\T V / 



TeT H 



which still seems to contain the problematic boundary layers that we tried to get rid 
of. Observe that we integrate over the layer U(T) \ T. This is exactly the region we 
encounter unpleasant boundary effects of the correctors Q^' a {uH) and This 
might imply that the Petrov-Galerkin formulation should be preferred. 

Let us review the two essential results concerning the convergence of MsFEM ap- 
proximations with oversampling. The first result is due to Gloria and the most general 
result currently available for Strategy 1 and 2: 

Theorem 3.5. Let A e £ L°°(il, Mf*^) be a sequence of matrices with uniform spectral 
bounds 7 min > and 7 max > j min , 

o-{A £ {x)) C [7 min j 7max] for almost all 1 £ 1! and for all e > (17) 

and assume that A 6 is G-convergent. Let furthermore u e H € Vh denote the corre- 
sponding MsFEM approximation either obtained with Oversampling Strategy 1 or with 
Strategy 2 and let 

diam(C/(T)) " diam( -^ -> for H -> 0. 



Then we have: 



diam(T) 



lim lim \\u £ — u £ tt\\t2(q\ = 0. 



The proof is given in [GI0O8J , Theorem 6 and Remark 7. At first glance the result 
appears counter-intuitive in the sense that it suggests to let the oversampling converge 

to zero. However, the first limit is in e, which makes the relative thickness of 

"it 

the oversampling layer grow to infinity. Hence, the correct interpretation is that for 
fixed e the computational domains should blow up to infinity. In this case, the optimal 
corrector problem is an equation formulated on whole M. d . These corrector problems are 
exactly the cell problems known from periodic and stochastic homogenization theory. In 
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the periodic setting the classical cell problem can be extended to the M. d by periodicity 
and in the stochastic setting they are directly formulated in M. d to obtain the correct 
stochastic average (c.f. [JK094 ). 

Theorem 13.51 gives a clear message in the case of extremely small microscopic vari- 
ations. If e (the characteristic length scale of the fine scale oscillations) is (globally) 
sufficiently small then the resulting MsFEM approximation yields very good approxima- 
tions. This is a very important result, but it does not answer the question how (thick) to 
choose an oversampling patch. Especially we cannot predict how the method behaves 
if there is a large spectrum of oscillations without a scale separation. For instance, we 
might encounter variations, where it is hard to tell which of them are macroscopic and 
which are microscopic (i.e. 'e-dependent'). In practice, we do not construct an artificial 
sequence in e, we only have a given scenario and a given set of data. 

The next theorem due to Hou, Wu and Zhang is much more restrictive, but it gives a 
more quantitive answer than Theorem 13.51 

Theorem 3.6. Assume that d = 2, f £ L 2 (Q) and that A is a bounded, elliptic, sym- 
metric and e -periodic C 3 -matrix, i.e. A(x) = A p (^), with A p £ C 3 ([0, l} d , Rf^) being 
periodic. Let u £ H £ Vh,T H denote the MsFEM approximation obtained with Oversam- 
pling Strategy 2. Then: 

Uh - u E \\ L 2 (n) < C + H + e (logtf)*) , 

\ h 

£ \\Vu* H - Vu £ ||| 2(n) < C (j- + H + e^). 
K TeT H J Va « J 

A proof of this theorem is given in [HWZ04 . The assumption d = 2 seems to be 
essential for their strategy. Note that in [HWZ04J the theorem is formulated without 
the -i^nr contribution. Instead, the authors make the assumption that the oversampling 

layer is sufficiently large. Following their proofs one can easily see that the generalized 

estimate reads as above (c.f. [EH09| for the case = CH). In particular, the -gls- 

term describes the decay of the error between the exact corrector and the corrector with 

wrong boundary condition in a coarse element T. The decay turns out to be inverse 

proportional to the thickness of the layer. Because of the e scaling of the solution, the 

effective term becomes -S^. This seems to be a sharp estimate for the decay due to the 

a u 

findings in [EHWOOj and [CH03J . A proof of Theorem 13.61 for Oversampling Strategy 1 
can be achieved in the same fashion as in [HWZ04J. Theorem 13 . 61 predicts the following: 
if locally 0{H)=0{e) the patch size of the local problems must not be of order 0(H) 
to preserve convergence. Still, the theorem only gives an answer of how to choose the 
oversampling patches U if e is a known parameter. 

If the thickness of the oversampling layer is of order 0(h) both estimates in Theorem 
13.61 receive an order O(^) term and the right hand sides remain large. In general, the 
thickness d^ in must be large in comparison to e. Analytically, this implies that 0(H)- 
oversampling might be not enough in regions where we deal with resonance errors due to 
0(H)=0(e). This seems to show up in the numerical experiments in [HWZ04J, where 
the authors observe a stagnation in the convergence of the if terror for H entering the 
region with 0(H)=0(e). The effect on the L 2 -error is less strong. However, the value 
of dy 11 is missing in the experiments in [HWZ04J so we can only assume that dff n is 
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of order H. Otherwise the computation of the MsFEM basis functions becomes quite 
expensive. 

Remark 3.7. In [EHWOOj . the symmetric version ([10]) of the MsFEM is considered. 
Here, the derived L 2 -estimate reads: 



where u £ H denotes the MsFEM approximation of the symmetric problem (|10p deter- 
mined with Oversampling Strategy 1. Due to the numerical experiments in [EHW00] . 
C r seems to have a considerable size, so that this term is dominating the estimate if 
locally 0(e) =0(H). We notice that the estimate is worse then the L 2 -estimate for the 
Petrov-Galerkin version of the method, because the last term cannot be reduced even 
for large dy in . These observations are consistent with Remark 13.41 However, this leads 
to an additional problem of the MsFEM with Strategy 1 or Strategy 2. On the one 
hand, the Petrov-Galerkin version should be preferred over the symmetric version (see 
the estimates). On the other hand, the existence and uniqueness of the corresponding 
MsFEM approximations has not been established so far, not to mention stability. For 
the symmetric version, we can simply exploit the ellipticity of A to conclude that the 
method is well posed and stable. For the Petrov-Galerkin version there is no such argu- 
ment. The only result is a perturbation result due to Gloria fGlo08], saying that if the 
oversampling size is small enough (i.e. if the difference between Petrov-Galerkin for- 
mulation and symmetric formulation is small enough) then we still have existence and 
uniqueness. So far, we do not know of any generalization. The missing well-posedness 
of the Petrov-Galerkin MsFEM's for Strategy 1 and 2 is a remarkable issue of these 
approaches. 

The e-terms in the estimates that cannot be reduced with H ^ e should be seen as 
fixed modeling errors. They describe the error between exact solution and homogenized 
solution. In a general non-periodic non-stochastic scenario they cannot be quantified. 

In conclusion we have two findings. First, in general, both approaches do not show 
clear asymptotics for a convergence to the exact solution (for H y e). There is always 
a remainder of order e, even if U(T) = Q. In particular, this is a problem if e unknown 
or if the micro structure is heterogeneous. Second, if the modeling error of order e is 
negligible, Theorem 13.61 still suggests that linear convergence (with respect to H) can 
only be achieved if the oversampling thickness scales with 0(1) which makes the local 
problems prohibitively expensive. Since the estimate for the decay rate -jjjjj of the 

corrector error is sharp (in the periodic setting) we cannot hope for much improvement 
of the final error estimates stated in Theorem 13.61 

We may summarize the following issues that we address to solve with our new over- 
sampling strategy that we propose in the next section: 

a) elimination of resonance errors of any kind, 

b) clear prediction for the size of oversampling patches without explicit knowledge 
about the micro-structure or scale separation, 

c) construction of a conforming approximation in Hq(Q), 

d) a rigorous error analysis in H without restrictive regularity assumptions on the 
coefficients and for all space dimensions, 
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e) a-priori error estimation in the fully discrete setting (previous results were ob- 
tained under the assumption that the local problems are solved exactly), 

f ) formulation of a stable approach for which we can guarantee existence and unique- 
ness of the resulting MsFEM approximation, 

g) prevention of unstable splittings due to point evaluations as, e.g., required to 
implement the constraint in oversampling strategy 1 (cf. Definition of V£(U(T)) 
in (USD). 



4 Constrained oversampling 

In this section we introduce a third oversampling strategy for which we derive a rigorous 
a-priori error estimate. The results are presented in Subsection l4. 1 1 and a corresponding 
proof is given in Subsection 14.21 



4.1 New strategy and rigorous convergence 

In the following, let TV denote the set of Lagrange points of the coarse grid Th- F° r a 
given node z € N ' , & z € Vh denotes the corresponding nodal basis function. 

Our new approach is based on some multiscale decomposition of the space Vh, 

V h = V H ® W h , (18) 

where the space Wh contains the 'fine scale' functions of Vh, i.e., functions that are not 
captured by Vh- More precisely, we choose Wh to be the kernel of some Clement- type 
quasi-interpolation operator Ih ■ Hq(U) — > Vh, 

W h :={veV h \ I H (v) = 0}. (19) 

The operator Ih ■ Hq(Q) — > Vh is defined as follows |Car99j . Given v € Hq(Q), 
Ih v := Ylz£Af(lHv)(z)&z is determined by the nodal values 

(I H v)(z) := (20) 

for z G W. The nodal values are weighted averages of the function over nodal patches 
lo z := supp^. The operator is linear, surjective, bounded, and invertible on the 
finite element space Vh- Hence, the decomposition (|18p exists and is stable; it is even 
orthogonal in L 2 (fi). 

Given some element T € Th and an admissible patch U(T), we define to be 

the Clement-type quasi-interpolation operator with respect to the domain U (T) (with 
extension by zero in \ U(T)), i.e., I H v(z) is given by (|20|) for interior vertices 

TT( r P\ 

z € Af PI U(T) and (I H )v(z) = for all vertices z G M \ U(T). This superscript 

U(T) in the notation I H shall indicate that I H v satisfies homogeneous Dirichlet 
boundary conditions on dU(T), i.e., the definition (|2Up applies only vertices interior to 

U(T) 

the patch. Hence, I H v = does not imply Ihv = 0, in general. This subtle detail 
ensures that our new method boils down to the classical MsFEM in the case without 
oversampling. However, it generates some technicalities in the proofs later on. 
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Recall the (local) approximation and stability properties of the interpolation oper- 
ators Itf^ |Car99j : There exists a generic constant C such that for all v G Hq(Q) and 
for all K G Th with K C U k {T) it holds 

H T l \\v - I U H {T) v\\ L 2 {K) + \\V(v - I U H {T) v)\\ L 2 {K) < C\\Vv\\^ {uk) , (21) 

where ojk '■= U{K' € Th \ K' n K ^ 0}. The constant C depends on the shape 
regularity of the finite element mesh Th but not on the local mesh size Ht '■= diam(T). 

Remark 4.1 (Nodal interpolation). Since we consider a fully discrete setting, where 
corrector problems are solved in the fine scale finite element space Vh, we could have 
chosen nodal interpolation instead of Clement-type interpolation. The subsequent def- 
initions and results will be almost verbatim the same. However, nodal interpolation 
does not allow the estimate (|2ip with an /i-independent constant if d > 1. The best 
constant C = Cd{h) reads C2{h) = \og(H/h) and C^{h) = (H/h)' 1 depending on the 
spatial dimension d (c.f. [Yse86|). Since this constant enters basically all error esti- 
mates below, we would end up with an /i-dependence of the multiplicative constants in 
the final error estimates. In 2d this can be acceptable because the dependence on h is 
only logarithmic. 

With the decomposition ()18p we do not search the local correctors in the full fine 
scale space Vh, but only in the constrained space Wh- The advantage is the following: 
as stated in the previous section for oversampling strategy 1 and 2, the standard decay 
for the difference between the local correctors and the global 'exact' corrector is of order 
(see Theorem 13. 6p , but in the constrained space Wh we can achieve an exponential 

decay (cf. Lemma 14.91 below) . 

We now propose our new Oversampling Strategy: 

Oversampling Strategy 3 (Constrained oversampling). Let Wh denote the space 
given by (fTHj) and define 

W h {U(T)) := {v G W h | I U H [T) v h = and v h \n\u{T) = 0}. 
IA 3 

The local correctors G W h (U(T)) (for i € {1,2, ..,d}) are defined as the (unique) 

solutions of 

f AVw%* i -Vcl>h = - f A ei -V<f> h for all 4>h G Wh(U(T)). (22) 

JU{T) JT 

IA 3 

For general &h G Vh we define the correction operator Q h ' : Vh — > Vh by 

d 

Q u h *(<$> H ){x):= Y, Y^hMw^x). 

TeT H i=l 

Now, the global coarse scale approximation u H G Vh is the solution of (fTU|) . i.e. it 
solves 



= [ f{*H + Q U h\*H)) (23) 
Jn 

for all &h G Vh- The corresponding MsFEM approximation is given by 
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Remark 4.2. As for Strategy 1 and 2, in the special case U(T) = T (no oversampling) 
Oversampling Strategy 3 coincides with the classical MsFEM stated in Definition 12.11 
(and rewritten in ([6la-c)). In general, the crucial differences between the classical 
Strategies 1, 2 and Oversampling Strategy 3 are the following: 

a) The variational problem for the local corrector in Oversampling Strategy 3 is 
posed in the constrained space Wh(U(T)) whereas the classical corrector problem 
seeks the local corrector in the full space Vh(U(T)) restricted to the patch. 

b) The support of the integrals on the right hand side in (fT4"|) and (fT6|) is U(T). 
In our new version we use only the element T. This allows us to exploit nice 
summation properties of the local projectors, without using indicator functions 
Xt that lead to discontinuities. 

c) In the classical setting, the local correctors are restricted to the corresponding 
elements to derive the global corrector. For Oversampling Strategy 3, we simply 
sum up (weighted by the coefficients of the finite element function) the local con- 
tributions to get the global corrector. Note that our global corrector is conforming 
in the sense that its image is a subset of Vh C Hq(Q) whereas the classical setup 
leads to a non-conforming corrector. 

d) In Strategy 3, we do not use a Petrov-Galerkin formulation for the global problem 
(|23|) . Since A is assumed symmetric, a symmetric discretization appears more 
natural. Furthermore, we immediately inherit coercivity for the global bilinear 
form A s . This gives us existence and uniqueness of u H and the arising MsFEM 
approximation is well posed and the method stable. The typical disadvantage 
of the symmetric version which still suffers from resonance errors (which is why 
Petrov-Galerkin is typically preferred) does not remain for our strategy. 

Remark 4.3 (Perturbation of the right hand side). We might also replace the right 
hand side of (|23p by the term j Q f&H- This introduces only a perturbation of order 
11-^/11^(0) m the H terror. 

Remark 4.4 (Non-Symmetric formulation). As for the classical strategies, one might 
also consider the non-symmetric Petrov-Galerkin formulations: find u H £ Vh such that 

J A (V(u% + Q U h ' S (u H ))) ■ V<S> H = /*h for all $h G V H . 

or 

f A (V4) • V($ ff + Q U h ' 3 (<S> H )) = I f$H for all $ H £ V H . 
Jn Jn 

We note that Theorem 13.51 still holds for MsFEM approximations obtained with 
Oversampling Strategy 3. The reason is that Theorem 13.51 in particular covers the 
case without oversampling (see also [GI0O6J). In fact, the proof given in |Glo08| goes 
back to the arguments used for the case without oversampling. But for U = Th (no 
oversampling), all strategies 1,2 and 3 are identical. Therefore, the proof is similar for 
each approach. Especially concerning Strategy 3: if the thickness of the oversampling 
layer decreases faster than the coarse mesh size, we are almost in the case of Over- 
sampling Strategy 1, up to a small perturbation of the source term that is of order 
maxT G 7^ and that converges to zero under the assumptions of Theorem 13.51 
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In this sense, Strategy 3 also covers the homogenization setting established by Gloria, 
with u H converging to the homogenized solution as in Theorem 13. 51 

Besides obvious advantages of our new strategy mentioned previously, e.g., its confor- 
mity, stability, unique solvability, we now formulate the main convergence result which 
is proved in Subsection 14.21 



Theorem 4.5 (Rigorous a-priori error estimates). Let Th be given coarse triangu- 
lation and let U denote a corresponding set of admissible patches, with the property 
dy' n y Hlog^H -1 ). By Th we denote a sufficiently accurate fine triangulation of Q, 
where Uh denotes the associated finite element solution given by If u H MsFEM is the 
MsFEM approximation determined with Oversampling Strategy 3 and if u H denotes the 
corresponding coarse part, then the following a-priori error estimates hold true: 

\\Vu h - Vu^ MsFEM || i2(n) < CH, 

II 3, MsFEM 1 1 , fi TT 2 

IK - u h \\L 2 (n) < C H , 
IK - u h\\lHq) < CH. 

Here, C denotes a generic constant that depends on f , 7 m ; n and 7 max but not on the 
regularity of the exact solution or the variations of A. Details on the constants are 
given in Theorems \4-12 and \4-14\ 



4.2 Proof of the main result 

Before we prove the error estimates for the MsFEM with the correctors presented in 
Oversampling Strategy 3, we introduce some simplifying notations for this subsection. 

hi 3 ° 

Definition 4.6 (Notations for Oversampling Strategy 3). Let w h ' Ti € Wh{U(T)) de- 
note the local corrector basis given by (|22]). let Q^' 3 denote the corresponding corrector 
operator from Strategy 3 and let u H denote the arising (coarse) MsFEM approximation. 
In the following, we skip the redundant indices and use the following notation: 

i hi .3 ^-viY,3 3 i MsFEM § s \ / \ 

W T '■= w h,T,V Qh '-=Q h ' U H := U H and u --UH + Qh{UH)- 

The first lemma treats the (unpractical) case of maximal oversampling. 

Lemma 4.7 (Error estimate for maximal oversampling). Let U(T) = for all T 6 Th- 
Then the multiscale approximation uh that solves (|23|) satisfies the error estimate 

\\Vu h - V(u H + Qh(uH))hz(n) < 7 m l\\Hf\\ L 2 {n) , 

where Uh solves the reference problem ([3]). 

If, moreover, (f,Wh)L 2 (n) = f or a ^ fi ne scale functions Wh £ Wh, then uh + 
Qh(un) = Uh- 

Proof. For U(T) = $7, Qh maps onto the fine scale space Wh- Given &h £ Vh, it is 
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easily checked that Qh(^n) = Y1t&t h Si=i dxi^H{xT)w l T satisfies 
a(Qh($H),<t>h) 



Jn \TeT H i=i / 
Jn \r eTH i=1 J 

d ,. 
TeTn t=l 



n e7]j i=l Jt 
J2 j T AV$ H -V<t>h 



T£T H 
= -a($ H ,(f) h ) 

for all 4>h G W^. This means that is the orthogonal projection onto the fine 
scale space Wh with respect to the scalar product a(-, •). Hence we get the orthogonal 
direct sum 

Vh = V H ®± a W h , where V H := {$ H + Qhi^n) \ ®H € V H }. (24) 

Moreover, it holds Galerkin orthogonality, i.e., for := Uh — (uh + Qh{uh)) and for 
arbitrary $ H + Q h ($ H ) G Vff, 

a(e fcj + Qfc(^fr)) = «K> $h + Qh(^h)) ~ a{u H + Qh(u H ), $h + Qh(&H)) 

# 0. (25) 

The combination of and ([25]) shows that E and therefore I#(eft) = 0. We 
obtain 

TminllVe/J^n) < a(e h ,e h ) = a(u h ,e h ) 

= feh= f(e h - I H {e h )). 
Jn Jn 

The application of the Cauchy-Schwarz inequality on the element level and the estimate 
(I2ip for the interpolation error yield the assertion. □ 



Corollary 4.8. The new MsFEM is exact (up to the discretization error on the fine 
scale and oscillations \\H f\\L 2 (n) °f the right-hand side f) in the limit of maximal 
over sampling. This results holds true independent of the upper spectral bound 7 max and 
the variations of A. This is the next difference to the previous Strategies 1 and 2. 

Although the error estimate in Lemma 14.71 is encouraging, maximal oversampling 
is not feasible. We shall study the decay of the correctors away from element they are 
associated with. For all T G Th, define element patches in the coarse mesh Th by 

Uo(T) : = T, 

U k (T):=U{T' eT H \T' r\U k ^{T)^%} £ = 1,2,.... 
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Lemma 4.9 (Decay of the ideal correctors). Let U(T) = 0, for all T G Th in Over- 
sampling Strategy 3. Then, for all T G Th and all k G N, 

\\A 1/2 W T \\ LWkiT)) < e- r -"\\A 1 / 2 VwUvv), 

where r is a positive constant that depends on the square route of the contrast but not 
on the mesh size or the oscillations of A. 

The proof of Lemma UJ2 requires the definition of cut-off functions and an additional 
lemma. For T G Th and £, k G N with k > I, define ?7r,fc/ £ V# with nodal values 

Vr,k,e(z) = for all z G A/" n U k -t(T), 

riT,k,e( z ) = 1 for all z G A" n ($7 \ U k {T)) , and ( 2 7) 

Tft 

vr,kA z ) = j for a11 x eJ\fndU k - i+m (T), m = 0,1,2,... ,l 

Given some w £ Wh, the product r]T,k,iw is not in Wh in general. However, the distance 
°f ^Tjfc i w and Wh is small in the following sense. 

Lemma 4.10. Given w G Wh and some cutoff function r]T t k,£ £ Vff as in (|27p . t/jere 
exists some u} G \ Uk~£-i(T)) C suc/i i/iai 

||V(r^, M w; - t5)||r2 ( n) < ^ 1 ||Vu»|| i 2 (C7fc+2(T)Wfc _^ 2(T)) . 

Proof Fix some T G Th and G N and let r]i := r/Tfei- The operator : /^(fi) n 
C(O) — > y ft denotes the nodal interpolant with respect to the mesh Th- Recall its 
(local) approximation and stability properties 

\W(p-IhP)\\mt)<h t \\V 2 p\\LHt) and \\V(I h p)\\ L 2 {t) < \\Vp\\ L 2 {t) (28) 

for all quadratic polynomials p and all t G Th- We will use this estimate for the T%- 
piecewise quadratic function p = rjgw. Since V 2 % = V 2 u> = in every t G Th, we have 
that V^rjiw = Vr^ • Vw in t. 

According to [MP 111 Lemma 1], there exists some v G V h such that 

Ihv = Inhimw), \\Vv\\ L 2 (Q) < \\VlHh(vew)\\L 2 (n), and supp(w) C O \ U k -e-i(T). 



(29) 

Hence, w := Ih(rj£w) — v G \ Uk-t-i{T)). Since Inlhicw) = cIhw = for any 

c £ R, we set := I^tI" 1 J Ut % and get 



S/I H Ih(vew)\\ 2 L 2 {n) 




E 




TeTff: 




Tc£/ fc+1 (T)\C/ fe _, 




s E 




T6T H : 




Tc£7 fc+1 (T)\t/ fe _, 




s E 








TcU h +l(T)\U k - e 









rj£ - c T w 



2 

L 2 (T) 



L 2 (uj t ) 



\{^m)(w - I h w)\\ 2 L 2 {u1t) + (ri e -c^jVw 



2 

L 2 {u> T ) 



2 (!7 fe+ i(T)\!7 fc _,_ 1 (T)) 

TCl7fe + l( T )Wk-*-i( T ) 



L 2 (w T ) 



£ \\HVr] e \\ 2 Loo(n) \\Vw\\ 2 L2(Uk 



+ 2 (T)\u k . e . 2 (T))- 
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In the last step, we used the Poincare inequality with 

11% - 4lli«( WT ) ~ M^H^ - 4\\h(u T ) < ^ 2 |^Tr 1 ||V%||2 2(ajT) < H 2 \\V7] e \\l x{u]T y 

In summary we get with the previous computations: 

l29t 

\\^(.V£W -w)\\ 2 L 2 {n) < \\V(r]ew-I h (r]ew))\\ 2 L 2 {n) + \\VI H I h (r]iw)\\ 2 L 2 {n) 

l28t 

< \\hVru • Vw\\ 2 LHn) + \\HVrii\\l x , n) \\Vw\\ 2 L 2 (u 



T ||-" vi/<|| £oo(n )|| v <"\\L*(U k+2 (T)\U k -^ 2 (T)) 

(f27T).(f2Tt 

-r—T 1 1 2 II TTX~7 W'Z \ 1 1 V7 | 

" lL 2 (C/ fc+2 (T)\r/ fe _ (; _ 2 (T)) 



< (\\hVve\\ 2 L o* iu) + \\HVv£\\loo {n) ) \\Vw\ 12 
123 

< £ ll Vw llL2(C/ fc+2 (T)\C/ fc _,_ 2 (T))- 



This proves the assertion. □ 

Proof of Lemma \4.9[ The proof exploits some recursive Caccioppoli argument as in 
[MPllj . We fix some T G Th and HN. Given £ G N with £ < fc - 1, let % := 
f]T,k-2,i-i £ Vff be some cutoff function as in (|2"7|) . Lemma [4.101 shows that there exists 
some u;^ G W h such that ||V(r?^ - tWy)|| £ 2(n) < ^ 1 ||V^|| L 2 ( ; 7j= ( T )\ c/fe _ f ( T )). Observe 



/ 



V\U k -i{T) 



AVwip ■ SJwip = / AVi4 • Vz4 = Aa- Vw^ = (30) 

if) iT 



because of w l T G \ Uk-i+i(T)). The definition of 7%, the product rule, (f22|) . and 

(USD yield 



/ AVw^ • Vw % T < / r] e AVw l T ■ Vw l T 

Jn\u k (T) ' Jn\u k . e (T) 

< 



CP 



AVw l T • (V(%iOy) - wfpVr/g) 

n\U k _ e {T) 

/ AVu;^ • V{rjiw l T - w l T )) - (w l T - Ih(w 1 t ))\7t}i 

Jn\u^,(T) \ y — v — ' 



Observe that, by ([271), HV^I^^ = \^r]e(x K )\ < l~ l H^} for all K G Tff. This and 
the estimate ()2ip for the interpolation error show that 

- i^(t«r))VJ7/|lia (J0 < ||V77/|||oc (JO ||t4 - M^iOll!^) 

<r 2 ||v<4lli >K) 

for any K G Th- The combination of the previous estimates and Cauchy-Schwarz 
inequalities prove that there is some constant C\ > independent of T, £, k, and the 
oscillations of A such that 

IIAVaVti^H^ny^D) ^ c i r i^ 1/2 V^IU Wfc _,_ l( T)) (31) 

The choice £ := \C\ e] and the recursive application of (|31|) readily yield the assertion. 

□ 
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This exponential decay justifies the approximation of the correctors on local patches 
Uk{T) as proposed in ([26]) . We denote by Q k the corrector that corresponds to the 
choice U(T) = Uk(T) in Oversampling Strategy 3 and by the one for U(T) = ft. 

Corollary 4.11 (Truncation error). Let U(T) = SI for all T € Th in Oversampling 
Strategy 3. Then, for all T € Th and all k 6 N, 

ll^ 1/2 VW - 4 fc )IUw fc (T)) < e^WA^e^^, 
where r > is as in Lemma \4.9\ (independent of the variations of A or the mesh size). 
Proof. Galerkin orthogonality yields 

H^/^v^ _ ^ fe )||| 2(0) < n^i 1 /^^ _ ^)||i 2(c/fc iC:r)) + ||^ 1/2 v^ni 2(OWfe iC:r)) , 

where w € Wh is the finescale function that corresponds to (1 — Tfr,k-l,l) w T an< ^ wri ich 
is constructed in the same way as w in the proof of Lemma [4,10i Here, Tfrk—il ^ s some 
cut-off function as in ([27]) . Since supp(?i) C supp((l — 7/r,fc-i,i)f t) c ^k~i(T) we have 
that w G Vf/i(?7fe(r)) and the use of Galerkin orthogonality is justified. Proceeding as 
in Lemma |4. 101 shows that 

l^ /2 VK - 4 fe )lli* ( n) < P 1/2 V^||| 2Wfe _ 2( T)) 

and the application of Lemma 14.91 yields the assertion. □ 

Theorem 4.12 (H 1 error estimate). Given k € N, Zei J7(T) = U k (T) for all T £ Th 
in Oversampling Strategy 3. Then the multiscale approximation u H that solves (|23p 
satisfies the error estimate 

\\Vu h - V(u k H + Q k h {u k H ))\\ < ^J\Hf\\ L2{n) + e- rk \\f\\ H - 1{n) , 



where is the reference solution from ([3]) and r > as in Lemma \4-9 



Remark 4.13 (Relation to the results [MPllj). In the case of maximal oversam- 
pling, the new MsFEM with constrained oversampling coincides with the ideal version 
(without localization) of the variational multiscale method presented in [MPllj. The 
localized versions are different but allow similar error estimates. 



Proof of Theorem \4-l 6 A Let uh be the solution of ([23]) with the ideal corrector 
Q%. Then 

\\ A y\Vu h -V(u H + Qi(u H )))\\ LHn) 

< \\A^ 2 (yu h - V(u H + Q k h {u H )))\\mn) 

< \\AV 2 {Vu h - V(u H + Q%(u H )))\\ L2m 

+ \\A l / 2 {V(u H + Q%(u H )) - V(u H + Q k h (u H )))\\ 

< %Jl 2 \\Hf\\ LHQ) + P 1/2 V((Q^ - Q k h )(u H ))\\mn)- 
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By Corollary 14.111 we get 

\\AV 2 V((Q%-Q k h )(u H ) 



li 2 (n) 



(*) 
< 



< 



< 



< 



TeT H »=1 



10, 



I k - 



L 2 {n) 



TeT H *=l 

^-ar.* £ f:i^^(x T )| 2 ||^ ei ||| 2 
TeT ff i=l 

Ter ff i=i 

2r-fc|| /f||2 



2 



(T) 



k d e -2r-k 



k d e 



2 

L2(T) 



In the last step we have used that uh = Ih{uh + Qh( u H)), the stability of Ih and the 



energy estimate \\A 1 / 2 V(uh + Qh( u H)\\L 2 (T) 



< 



-1/2 



The step (*) would 



be obvious with k d replaced by the number of coarse elements. The presence of this 
-ff-dependent constant would not be crucial because it is easily compensated by the 
exponentially decaying factor. The refined estimate with multiplicative constant k d 
(which reflects the overlap of the patches) is possible because a{w l T — w 1 ^ , urL — w 



k ) 



is essentially zero for T and K at a certain distance. We omit the technical details and 
refer to |MP11[ Lemma 16] which carries over to the present setting using Lemma [4.10i 

□ 

Theorem 4.14 (L 2 -estimates). Given k £ N, let U{T) = U k (T) for all T G Th 
in Oversampling Strategy 3. Then the multiscale approximation u H that solves (|23p 
satisfies the error estimate 



\Uh 



(u k H + Q k h (u k H ))\\ L 2 



(fi) ~ (7 n 



\H\ 



+ k d '^ k f\\f\\ L , m 



and 



\Uh 



Uh\\l 2 (q,) 
"uh 



< min 



VHhHv + ^LlML^ + k^e-^) 2 



L 2 (Q): 



where u\ t is the reference solution from ([3]) and r is a positive constant. 

Proof. A standard Aubin-Nitsche duality argument yields the first estimate. The sec- 
ond estimate follows from the first one and the quasi-optimality and stability of the 
interpolation Ih in L 2 {VL). □ 



Remark 4.15 (Smooth coefficient with known smallest scale £.). Let be convex, 
/ 6 L 2 (Q) with ||/||i,2 ( fi) < 1, A e W 1 ' 00 ^) with \\VA\\ L oo {n] < e' 1 with some small 
scale parameter e > 0. Choose uniform meshes Th and Th with H > e > h. Under 
these assumptions, the error of the reference solution Uh £ Vh is bounded as follows, 



|V(u-0|| <her x . 
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We refer to see |PS12j for details. If k > log(i7 _1 ), Theorem ETTZl and Theorem I4TT41 
yield the error bounds 

\\W(u- U k H -Q k h (u k H ))\\ LHn) < H + ±, 
\\u-u k H -Q k h {u k H )\\ L , m < H 2 + (±)\ 
\\u ~ u k H \\ L 2 {n) < H+{±) 2 . 

5 Conclusion 

In this work, we proposed a new oversampling strategy for the Multiscale Finite Ele- 
ment Method (MsFEM) which generalizes the original method without oversampling. 
The new strategy is based on an additional constrained for the solution spaces of the 
local problems. The error analysis suggests to choose oversampling layers of thick- 
ness H\og(H~ l ) suffice to preserve the common convergence rates with respect to H. 
Moreover, this choice prevents resonance errors even for general L°° coefficients with- 
out any assumptions on the geometry of the microstructure or the regularity of A. In 
this respect, the method is reliable. The method is also highly efficient (as efficient 
as MsFEMs with classical oversampling) in the sense that structural knowledge about 
the coefficient, e.g. (local) periodicity or scale separation, may be exploited to reduce 
the number of corrector problems considerably or to reduce oversampling to very small 
layers. 
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